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INVARIANTS OF COMMUTING MATRICES 


C. PROCESI 


1. An open question 

1.1. Introduction. Consider the space of pairs (V, T) of n x n matrices over a field F 
of characteristic 0, and denote by Rn{X,Y) the polynomial algebra on this space. 

An intriguing, difficult open question in commutative algebra asks wether the ideal 
In, generated by the entries of the equation [X, Y] = XY — YX = 0 is a prime ideal in 
Rn{X, Y). Since it is known that the variety of pairs of commuting matrices is irreducible 
(cf. [1]) this means that the ideal of this variety is generated by these natural quadratic 
equations. 

Several attempts to this have been made but, as far as I understand this issue, no 
conclusive proof has been reached except for small n < 4 (Wallach private communication). 
In [3] the authors study a related problem, that is they study pairs of matrices with 
commutator of rank 1. 

From this they deduce that, for the case of 2 commuting matrices, the ring of invariants 
of the natural action by conjugation on i?„ (A, T)//„ is an integral domain. The purpose of 
this paper is to comment two papers, one by Domokos [2] and one of Francesco Vaccarino 
m where they prove the same statement but for the algebra of m-commuting matrices, 
for all m. This is in fact a consequence of a stronger theorem[Tl which gives an isomorphism 
of this ring of invariants with the invariants of diagonal matrices under the symmetric 
group. The method of proof in the two papers is quite different. The two papers are 
clearly independent and although published with some time gap I have been informed 
that the two preprints appeared on the archive more or less at the same time. 

The approach of Vaccarino is through polynomial maps and the one of Domokos via 
classical invariant theory. 

I want to point out some simply related approach which may apply to classical Lie 
algebras, since the same question can be asked for all simple Lie algebras (cf. [5], [10] 
and [12| 1 but I have made no attempt in this direction. It also gives some supplementary 
information on these algebras. 

1.2. Commuting matrices. We work over a field F of characteristic 0, for instance Q. 
Let us fix a positive integer n, for all m in fact for to = oo let us consider the ring of 
polynomials in the entries j,, h, k = 1,... ,n, i = 1,... , to of to, n x n matrix variables 
Xi := {x]^ j.), modulo the ideal generated by the entries of the commuting equations 
[Xi,Xj] = 0. Denote by An^m this ring and An = An,oc- On this ring acts the linear 
group GL{m,F) by linear transformations on the variables Xi and the group GL{n,F) 
by conjugation; the two actions commute. 
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One may consider in the ring Mn{An,m) the m generic commuting matrices images 
of the generic matrices Xi. 

Consider now the ring Bn,m (Bm = Bn,oo) of polynomials in m vector variables Xi = 
(ccip,..., Xi^n) which we also consider as m diagonal matrices. On this ring acts the linear 
group GL{m,F) by linear transformations on the variables Xi and the symmetric group 
Sn by conjugation; the two actions commute. We can also identify i?n,m = B®"^ where 
Bn = F[xi ^..., Xn\ is the polynomial ring in n variables. 

We have a natural restriction map tt : An^m Bn,m compatible under the action of 
GL{m,F) and S'™, which induces thus a natural map of invariants 

(1) it : ^ B^^ = (F[xi,..., xn^f-f-. 

Theorem 1. [Domokos, Vaccarino] The map it is an isomorphism. 

Proof. The proof from [5] is similar to the proof I want to present, the one in m is based 
on the Theory of polynomial maps. 

One first considers the isomorphism F[xi,..., Xm] F[:fi,..., to the ring of generic 
commuting matrices, when we compose this map with the determinant we have a poly¬ 
nomial map, homogeneous of degree n: 

(2) D : F[xi,. ..,xn,]^ F{iu ... ^ 

This map is multiplicative in the sense that D{ab) = D{a)D{b), hence by a general 
theorem of Roby, [5] it factors through a homomorphism of the symmetric tensors 
D : ..., —>■ An^^’^^. So the point is just to prove that this map is 

inverse to the restriction. The composition it o D is clearly identity, so D is injective and 
TT surjective the issue is in the other order D o tt. In m this is proved by showing a 
correspondence between generators of the two algebras. For matrix invariants it is known 
(even before imposing the commutative law), that invariants are generated by the coeffi¬ 
cients of the characteristic polynomials of primitive monomials in the variables Xi, see [I], 
a similar statement holds for the symmetric group and one verifies the correspondence. 

Here I want to point out a slightly different approach, which consists in applying the 
classical Arhonold method, of reducing to multilinear elements. As I will point out at 
the end this also gives some normal form for the invariants. Remark that tt is GL(m,F) 
equivariant for all m, thus it is enough to prove the statement for m = oo and, by 
the classical method of Arhonold, it is enough to prove that it is an isomorphism when 
restricted to the multilinear elements. 

We thus have to gather some information on these elements. 

Let us start with the multilinear elements of Bnfn- The multilinear elements of Bn^m 
have as basis the monomials 

^1,21 • ■ ^ H 

Thus these monomials can be thought of as maps from [m] := {1, 2,..., m} to [n]. 

These monomials are permuted by the symmetric group so that the space of multilinear 
elements of B^’i^ has as basis the orbits of on such a space of functions. 

To any function / : [m] —i [n], is associated the partition of [m] into its fibers which 
is a partition of [m] into at most n subsets. This partition is independent of the orbit. 
Conversely given a partition A of [m\ into at most n subsets, we have a natural ordering 
of these sets by associating to each subset its minimal element and then ordering them 
according to the minimal element as Sx, S 2 ,..., Sk- We then associate to this the function 
/a : [m] [n] which takes the value i on Si. In its orbit under Sn this function is leading in 
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a suitable lexicographic order. Hence it is easily seen that in this way we have established 
a 1-1 correspondence between maps / : [m] —^ [n] up to symmetry and decompositions A 
of [m] into at most n subsets. 

Let us now turn our attention to ■ By classical invariant theory multilinear 

invariants of m matrix variables are spanned by the functions (pa where cr £ Sm is a 
permutation and 

pa{xi,X2, . ■ ■,Xm) = tr{xi^Xi^ . . . Xi^)tr{Xj^Xj^ . ..Xj^). ..tr{Xs.,Xs2 ■ --Xs^) 
there the various factors correspond to the decomposition into cycles of a (see for instance 

IS)- 

When we are working with n x n matrices we have a basic formal identity (equivalent 
to the Cayley-Hamilton identity) 

eapaixi,X 2 ,...,Xn+l) =0. 

crgS„+i 

We deduce from this that 

n+l 

pl{xi,X2,...,Xn+l)=Wtr{xi) = - y] eaPa{xi,X2,...,Xn+l) 

i=l (7G5n + l, Cr/l 

and then each summand of the R.H.S. of this equality is the product of at most n factors 
of the form tr{M) for M some monomial. 

Therefore by a repeated use of this identity we deduce that every invariant for n x n 
matrices is a linear combination of products with at most n factors of the form tr{M) for 
M some monomial. 

We apply this in particular to the multilinear invariants for commuting matrices, in 
this case the invariant tr[M) for a multilinear monomial M depends only on the variables 
appearing in M and not the order in which they appear. 

Given a set S of indices we may define thus an element trs £ An 

trs ■■= tr{W_Xi). 
ies 

Thus, given a partition A := {Si,..., S^}, k < n oi [m] into at most n subsets Si 
and considering the invariant “ Hi ^Si we have that these multilinear elements span 
linearly all multilinear invariants. Thus we can prove that they form a basis and that tt 
is an isomorphism as soon as we show that tt is surjective (proved already in |12|'). 

For this order lexicographically the monomials •. • Xm,i„ which can be just thought 
of as words of length m in the letters 1, 2,..., n. Observe that the leading term of t\ is 
the monomial associated to /a and the claim follows. □ 

Let us draw some consequences, let J be the nil ideal of clearly it is stable under 
both groups and we have 

Corollary 1.3. J does not contain any element which is invariant under GL{n,F). 

We should notice that if one is interested in describing these algebra, in detail, one 
method is to describe them as representation of GL(rn) (m is the number of copies) and 
this is done by describing each homogeneous component of degree k as a, direct sum of 
Schur functors S\{F'^) where A h A: is a partition of k. 


4 


C. PROCESI 


The description of homogeneous component of degree k passes through the descrip¬ 
tion of the multilinear part of degree k for the case of k copies as representation of the 
symmetric group implies the description of the homogeneous part of the same degree as 
representation of the linear group GL{k). 

In fact by the Theory of Schur functors, a partition A h fc defines a Schur functor S'a(IT) 
on vector spaces homogeneous of degree k. When we apply this to an fc-dimensional space 
W = with a prescribed basis we may consider inside the space Mx{F^) of multilinear 
elements, which by definition are the invariants for the torus of diagonal matrices in that 
basis with determinant 1. It then is true that M\{F^) is the irreducible representation of 
the symmetric group 5'fc, permuting that basis, associated to the same partition A. 

Of course S'A(bT) = 0 if the dimension of W is strictly smaller than the height of A 
so when m < k not all the representations appearing contribute. In our case it would 
remain to describe the decomposition into irreducible representations of the permutation 
representations, which we have seen describe the multilinear invariants, of decompositions 
A of [to] into at most n subsets. The orbits of this permutation action are parametrized 
by partitions of to of height < n, to each such partition corresponds the permutation 
representations given on cosets of the corresponding Young subgroup. These are well 
understood, there is an extensive literature, see for instance [9]. 
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